Search CORE

63 research outputs found

Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs

Author: Chen Danny Z.
Chen Jianxu
Chen Xiaoming
Hu Xiaobo Sharon
Publication venue
Publication date: 29/05/2017
Field of study

Convolution is a fundamental operation in many applications, such as computer vision, natural language processing, image processing, etc. Recent successes of convolutional neural networks in various deep learning applications put even higher demand on fast convolution. The high computation throughput and memory bandwidth of graphics processing units (GPUs) make GPUs a natural choice for accelerating convolution operations. However, maximally exploiting the available memory bandwidth of GPUs for convolution is a challenging task. This paper introduces a general model to address the mismatch between the memory bank width of GPUs and computation data width of threads. Based on this model, we develop two convolution kernels, one for the general case and the other for a special case with one input channel. By carefully optimizing memory access patterns and computation patterns, we design a communication-optimized kernel for the special case and a communication-reduced kernel for the general case. Experimental data based on implementations on Kepler GPUs show that our kernels achieve 5.16X and 35.5% average performance improvement over the latest cuDNN library, for the special case and the general case, respectively

arXiv.org e-Print Archive

Crossref

Neuron Segmentation Using Deep Complete Bipartite Networks

Author: Banerjee Sreya
Chen Danny Z.
Chen Jianxu
Grama Abhinav
Scheirer Walter J.
Publication venue
Publication date: 31/05/2017
Field of study

In this paper, we consider the problem of automatically segmenting neuronal cells in dual-color confocal microscopy images. This problem is a key task in various quantitative analysis applications in neuroscience, such as tracing cell genesis in Danio rerio (zebrafish) brains. Deep learning, especially using fully convolutional networks (FCN), has profoundly changed segmentation research in biomedical imaging. We face two major challenges in this problem. First, neuronal cells may form dense clusters, making it difficult to correctly identify all individual cells (even to human experts). Consequently, segmentation results of the known FCN-type models are not accurate enough. Second, pixel-wise ground truth is difficult to obtain. Only a limited amount of approximate instance-wise annotation can be collected, which makes the training of FCN models quite cumbersome. We propose a new FCN-type deep learning model, called deep complete bipartite networks (CB-Net), and a new scheme for leveraging approximate instance-wise annotation to train our pixel-wise prediction model. Evaluated using seven real datasets, our proposed new CB-Net model outperforms the state-of-the-art FCN models and produces neuron segmentation results of remarkable qualityComment: miccai 201

arXiv.org e-Print Archive

Crossref

Dir-MUSIC Algorithm for DOA Estimation of Partial Discharge Based on Signal Strength represented by Antenna Gain Array Manifold

Author: Chen Bingshu
Hu Yue
Li Jianxu
Li Yandong
Xu Wencong
Zeng Zijing
Publication venue
Publication date: 19/04/2022
Field of study

Inspection robots are widely used in the field of smart grid monitoring in substations, and partial discharge (PD) is an important sign of the insulation state of equipments. PD direction of arrival (DOA) algorithms using conventional beamforming and time difference of arrival (TDOA) require large-scale antenna arrays and high computational complexity, which make them difficult to implement on inspection robots. To address this problem, a novel directional multiple signal classification (Dir-MUSIC) algorithm for PD direction finding based on signal strength is proposed, and a miniaturized directional spiral antenna circular array is designed in this paper. First, the Dir-MUSIC algorithm is derived based on the array manifold characteristics. This method uses strength intensity information rather than the TDOA information, which could reduce the computational difficulty and the requirement of array size. Second, the effects of signal-to-noise ratio (SNR) and array manifold error on the performance of the algorithm are discussed through simulations in detail. Then according to the positioning requirements, the antenna array and its arrangement are developed, optimized, and simulation results suggested that the algorithm has reliable direction-finding performance in the form of 6 elements. Finally, the effectiveness of the algorithm is tested by using the designed spiral circular array in real scenarios. The experimental results show that the PD direction-finding error is 3.39{\deg}, which can meet the need for Partial discharge DOA estimation using inspection robots in substations.Comment: 8 pages,13 figures,24 reference

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation

Author: Banerjee Sweta
Chen Jianxu
Dörr Stefanie
Grüneboom Anika
Lorenz Kristina
Sonneck Justin
Zhou Yu
Publication venue
Publication date: 09/06/2023
Field of study

Artificial intelligence (AI) has been widely used in bioimage image analysis nowadays, but the efficiency of AI models, like the energy consumption and latency is not ignorable due to the growing model size and complexity, as well as the fast-growing analysis needs in modern biomedical studies. Like we can compress large images for efficient storage and sharing, we can also compress the AI models for efficient applications and deployment. In this work, we present EfficientBioAI, a plug-and-play toolbox that can compress given bioimaging AI models for them to run with significantly reduced energy cost and inference time on both CPU and GPU, without compromise on accuracy. In some cases, the prediction accuracy could even increase after compression, since the compression procedure could remove redundant information in the model representation and therefore reduce over-fitting. From four different bioimage analysis applications, we observed around 2-5 times speed-up during inference and 30-80

\%

saving in energy. Cutting the runtime of large scale bioimage analysis from days to hours or getting a two-minutes bioimaging AI model inference done in near real-time will open new doors for method development and biomedical discoveries. We hope our toolbox will facilitate resource-constrained bioimaging AI and accelerate large-scale AI-based quantitative biological studies in an eco-friendly way, as well as stimulate further research on the efficiency of bioimaging AI.Comment: 17 pages, 6 figure

arXiv.org e-Print Archive

Combining Fully Convolutional and Recurrent Neural Networks for 3D Biomedical Image Segmentation

Author: Danny Z Chen
Jianxu Chen
Lin Yang
Mark Alber
Yizhe Zhang
Publication venue
Publication date: 10/04/2020
Field of study

Abstract Segmentation of 3D images is a fundamental problem in biomedical image analysis. Deep learning (DL) approaches have achieved state-of-the-art segmentation performance. To exploit the 3D contexts using neural networks, known DL segmentation methods, including 3D convolution, 2D convolution on planes orthogonal to 2D image slices, and LSTM in multiple directions, all suffer incompatibility with the highly anisotropic dimensions in common 3D biomedical images. In this paper, we propose a new DL framework for 3D image segmentation, based on a combination of a fully convolutional network (FCN) and a recurrent neural network (RNN), which are responsible for exploiting the intra-slice and inter-slice contexts, respectively. To our best knowledge, this is the first DL framework for 3D image segmentation that explicitly leverages 3D image anisotropism. Evaluating using a dataset from the ISBI Neuronal Structure Segmentation Challenge and in-house image stacks for 3D fungus segmentation, our approach achieves promising results comparing to the known DL-based 3D segmentation approaches

CiteSeerX

AAU-Net: an Adaptive Attention U-Net for breast lesions segmentation in ultrasound images

Author: Chen Gongping
Dai Yu
Li Lei
Yap Moi Hoon
Zhang Jianxu
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 01/05/2023
Field of study

Various deep learning methods have been proposed to segment breast lesions from ultrasound images. However, similar intensity distributions, variable tumor morphologies and blurred boundaries present challenges for breast lesions segmentation, especially for malignant tumors with irregular shapes. Considering the complexity of ultrasound images, we develop an adaptive attention U-net (AAU-net) to segment breast lesions automatically and stably from ultrasound images. Specifically, we introduce a hybrid adaptive attention module (HAAM), which mainly consists of a channel self-attention block and a spatial self-attention block, to replace the traditional convolution operation. Compared with the conventional convolution operation, the design of the hybrid adaptive attention module can help us capture more features under different receptive fields. Different from existing attention mechanisms, the HAAM module can guide the network to adaptively select more robust representation in channel and space dimensions to cope with more complex breast lesions segmentation. Extensive experiments with several state-of-the-art deep learning segmentation methods on three public breast ultrasound datasets show that our method has better performance on breast lesions segmentation. Furthermore, robustness analysis and external experiments demonstrate that our proposed AAU-net has better generalization performance in the breast lesion segmentation. Moreover, the HAAM module can be flexibly applied to existing network frameworks. The source code is available on https://github.com/CGPxy/AAU-net

E-space: Manchester Metropolitan University's Research Repository

Development of a Non-invasive Deep Brain Stimulator With Precise Positioning and Real-Time Monitoring of Bioimpedance

Author: Chen Duanduan
Gongyao Guo
Li Chunlin
Shi Yue
Shi Zhongyan
Sun Weiqian
Wang Heng
Wang Jing
Wu Jinglong
Xu Yifei
Yang Ruoshui
Zhang Jianxu
Publication venue: 'Frontiers Media SA'
Publication date: 08/12/2020
Field of study

Methods by which to achieve non-invasive deep brain stimulation via temporally interfering with electric fields have been proposed, but the precision of the positioning of the stimulation and the reliability and stability of the outputs require improvement. In this study, a temporally interfering electrical stimulator was developed based on a neuromodulation technique using the interference modulation waveform produced by several high-frequency electrical stimuli to treat neurodegenerative diseases. The device and auxiliary software constitute a non-invasive neuromodulation system. The technical problems related to the multichannel high-precision output of the device were solved by an analog phase accumulator and a special driving circuit to reduce crosstalk. The function of measuring bioimpedance in real time was integrated into the stimulator to improve effectiveness. Finite element simulation and phantom measurements were performed to find the functional relations among the target coordinates, current ratio, and electrode position in the simplified model. Then, an appropriate approach was proposed to find electrode configurations for desired target locations in a detailed and realistic mouse model. A mouse validation experiment was carried out under the guidance of a simulation, and the reliability and positioning accuracy of temporally interfering electric stimulators were verified. Stimulator improvement and precision positioning solutions promise opportunities for further studies of temporally interfering electrical stimulation

Okayama University Scientific Achievement Repository